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Seouenee T n fomiflri ftT1| 



Sequence ID No. l; ATR seq 
1 GCGCT 



151 ^TATAATAC/*nGTACA(^X^^ " 

226 TGTTGTTGCTGTAGMCTTCTAAAfiAWACTGACTCTCAGCCAACCTCCGTCATGTTGGTGA 300 

I^ T ^CaCCCCACnATGmGTAMTGTGAGT6GA^^^ ° 

451 T^aTCTOTammmCTTTTTAAAAGCAAi^Ca^ 52° 

SI ^^^^^ ™ 

751 GCAAGAACTCTTACTTTGGCAGATAGGTTGTGTTCTGCTAGAGTATGGTAGTCCAAAM gjc 
™ ^^AACAGAACTTTTTC^^ 

901 GGAATTATTAAAACACCTTGTAGAAATGGATACTGACCMTTGAAACTCTATGAAGAGC^ 97 c 

im SJ^ 7 ™* 1 ^^ 1050 

lit, t^ 1 * 771 ^"*™^^ 200 

1201 TATTTGTAAAGCTCTTTTGGATCTGCTTGGAATTG ^ 

1276 TTTGAAAATGGAAAGTATGGAMTCATT^TTCMTK 

351 TGATGGMTATCACCCAAAAGGCGTCGTaCAGCTCGT^ ^ 

150 GATTTCCCTTGAATACAB^ 500 

576 GACTGCTCTGTGTACTGnCAnGTTCTCATCAAAACATGAACTGCCGTACTTTCAAGGACTGTCM 65 o 
CAAG^CCTTCTGTAGTGftTAACTT^^^^ 

1726 TnGTTAGMTCTGTTCAGAAACTG(^CTGGAGGCAAXXAn ^0 
101 HAUTGOUGTAAACA^^ 

1951 nCAGATAGCTATTCACCACAGGCACMTCACGATGTGTGTTTCTTCTGACTCTGTTTCC^^ 2025 

2 26 ™™«A^^ ^ 

^ATmATTGCAGOG^TTCnGTAACAGAGnCCC^^ 2 5 
2176 nCTGACATTGT(^AGAAAGAATTTGCTTCTATACTTGGTCAACTTGTC7GTACT CTTCACGGCATGTTTTATCT 2 0 
2251 GACAAGTTCTTTAACAGAACCTnCTCTGAACAC 

™ I^™^^ 2400 

2401 ACCTAGTCCAGTAAMCnGCmCATAGATMTCTACATCAT^ 2475 

2476 AACAGATGTAAMGCAGTTCTTGGMCTTTAnAMTTTMTG^ 2550 
2551 TAGTGGAAATATCAAGCACATATTGGMTCCnGGACTCTGAAGATGGATTTATAA 

2626 AATGAAGGAAGCATATACACATGCCCAAATATCAAGAAATMTGAGCTGAAGGATACCT 2700 

2701 GMTATTGGAAGGGCCGCAAAAGGAGATTTGGTACCATTTGCACTCnAC^ ' 

2851 TTTTTTCAGCCAGTATAAGAAACCCAYCTGTCAGrrrTTGGTAGM 29^5 
3001 GnGTat^TTGCCMCGTTTTCGACmcCTGATCnMTCGTmcnACTAG^amC^GTTCTACT 3075 
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3076 ACCTGATCTTGCTGCCAAAGCAAGCCCTGCAGCnCTGCTCTCATTCGAACTTT AGGAAAACAATTAAATGTCAA 31SD 
3 51 TCGTAGAGAGATmMTAAACAACTTCAMTATATTTrTTCTCAmGGTC^ 2 

S ^^^TTOTGCGTATTGGAGAAIAaA^ £ ° 

3376 ATnGCATCCAGTGAT6ATCCATATCAGGGCCCGAGAGATATCATATCACCTGAACT6ATffiCT6An 

ACCCAAAnGTTGG^TTTTBGnTTTTnAACATa^ 



3676 TGTTCGCTGCCTBGATCATBCTTGTCTGGGCTCCCnCTCAGT^ 

390 WCCTCTGA^AGCACTGATCnCAGACA^ »? 

3976 TCGTATTCATGCTmACAAGCTOAAGGA^ JS 

4051 CAGTGAAACAGTAGAACCTATTATUrCACAGnGGTGACACTGCTTTTGAAA^ ^ 

4201 TGAAACTCAAGGAAAAGATTTTACATTTGTGACTGGAGTAGAAGATTCAAGCTTTGCC^^ 

4351 GCrnCTATnATGACTGTAGAGAGATGGAGACCAACGGCCCAGGTCACXMn 2Sc 
4426 ^CGGGAMTACTAGA^^ 4425 

rrTAAGTAAAnGGGTAGTMCTTTGCAGAATGGTCAGCATCTTGGGCAGGTT^ 
rGATCnGCWGTAAAATTTTCACCTGCTGTAGCATTATBATGAAGCATGATTTCAAAGT 
rCCACATATTCTGGTGTATtn"CTTACTG<KnGTAATCAAGAAGATCAGCAGGAG5TTTA 
^GTTCTAAAGCATGACGATt^GCATACCATAAATAGCCAAGACATTGCATCTGATCTGTG 
iACTGTGTTCTCCATBCTTGACCATCTCACACAGTGGGCAAGQCACAAAT^ 
CCACACAGCAMTCAAACAGAAATAAGGTAGACTCAATGGTATCTACTGTGGATTATGA 
ACCCGTTTTCTAGACCTCATACOCCAflGATACTCTGGCAGTAGCTTCCTTTCGCTCCAA 
GTMTGC^TTTGMTCATTTATTACAGAAAAGAAGCAAAATATTCAGGAACATCTTGG 

5251 GGCTATrCAGCTAQAACCAGACCAGATCATTCATTATCATGGTiin'ACT 5? 



5401 AGTGGAAGCAGCITGGAAAnGTCACAGTGGGATTTGGTBGAAMCTATTTGGCAGW EE 

5476 ATGGAGTGTCAGACTGGGACAGCTATTAnATCAGCCAAAAAAAGAGATATCACAGCrT^ £2 

5551 AaAGTGAGA«^AAC^ 5550 

Sf ^ T ^^ ATO » 6T1 ^T^ 5 5 7 6 o 

5851 GCTGCAGAGTGCCAGGGTAGCTAGAAAGGCTGGTCACCACCAGACAGCCTACMTGCTCTCCTrM^ 22 

22 iSSS?"^^ 6 0 

6001 TGTTnTCAAAAAGGTGTTGAATTATGTTncCTGAAMTGAAACCCCACCTGAGGGTAAGAAC^ 22 
6076 TGGTCGAGCTATGCTACTAGTQGGCCGATTTATGGAAGAAACAGCTMGTrGAAAGW 
f ATATAAGGATGT^CGCCTGCCTGCCAGMTGGGAG^ 

« ^CATGGTCACAGACAACAAMTGGAAA^ ^ 

6301 ATCTCTACAATATGGAMTCAGTTCATATATCAGTCMTGCCACGMTBnMCTCTATGGCT^ 22 

6451 GGTTATCACAGAGCATACAAACTATTTAKTCCATATCAAT " 5 ° 

6526 ™OOiaitt^^ * 
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6601 AGCMTGTGGATWTGACAGCTGTGTCAAAGTCATC7TATCCCATGC5TGT5AACAWTGCAAG6AAATCCTCAA 6675 

6676 TAAAGCTATTCATATGAAAAMTCCnAGAGAAGTTTC^ 6750 

6751 AnGTGCAATAMCCGGTTGATGGAAGTAGTTCCACATTAAG^ 6825 

6826 GGTAGAAGAAGCAACATTTAGTGAAATCCTCATTCCTCTAC^ 6900 

6901 GGGTACCCATGCTAAGCATGCTAGCCAT<^^ 6975 

6976 GGTGGAAATTOTGCTTCTCTTCAGAAACCAMGAAGATTTCTn 7050 

7051 GATfTGTAAGtEAAAAGATGACOGA^ 7125 

7126 AAGAAAAGATGCAGAGTCTCGTAGAAGAGAACTTCATAnCGA mo 

7201 TGGGAnATTGMTGGGTGAACAACACTGCTGGTnfiAGACCTATTCTGACCAAACTATATAAA 7275 

7276 GTATATGACAGGAAAttAACTTCGCCA^ 7350 

7351 CCGAGMTTTCTCCTGCCCAGGCATCCTCCTATTTrTCATGAGTGGTTTCTGAGMCATTCCCTG^ 7425 

7426 ATGGTACAGTAGTAGATOUXTTACTGCCGTTCCACTGCAGTAATGTCMTGGTTGGnATATTCTGG^ 7500 

7501 AGACCGTCATGCTGAAAATAnCTCTTTGAnCTnGACT 7575 

7576 CMTAAGGGAGAAACCTnGAAGnCCAGAAATTGTGCCATTTCGCCTGACTCATMTATGG 7650 

7651 TCCTATGGGAACAGAGGGTCTTTTTCGAAGAGCATGTGAA^ 7725 

7726 TTTMTGAGTGTCnAAAGACTTTTCTACATGATCC^ 7800 

7801 AGCGCCACTGAATGAMCTGGAGAAGTTGTCMTGAAAAGGCCAAGACC^ 7875 
7876 ACAAGGTGTMTCAAGACTCGAAATAGAGTGACAGGACTGCCGT^ 

7951 ACAAGAAGCTACTGATGAAAACTTACTATGCW^^ 8025 
8026 GTAAAAGAATATGTTAATMTCTAAMGTMTGCATTTGGTATGAATCTGTGGTTGTATCTGTTCMTTCTAAAG 8100 

8101 TACMCATAMTT7ACGTTCTCAGCMCTGTTATTTCTCTCTGATCATTMTTATATGTAAMTMTATACATTC 8175 

8176 AGTTATTAAGAMTAMCTGCTTTCTTMTAAAAAAAAAAAAAAAAAA 8239 

Sequence ID No. 2: ATR protein 

1 HGEHGLELASMI PALRELGSATPEEYNTVVQKPROI LCQFIDR I LTDVN VVAVElVKKTDSQPTSV 66 
67 MLLDF I QHIMKSSPLMFVNVSGSHERKGSC I EFSNW I ITRLLRIAATPSCHLLHKKICEVICSLLFLFKSKSPAI 141 
142 FGVLTKELLQLFEDL VyLHRSNVHGHAVEWPWMSRFLSQLDEHMGYLQSAPLOLHSMQNLEF I EVTLLMVLTRI 216 
217 IAIVFFRRQELLLWOIGCVLLEYGSPKIKSUISFLTELFOLGGLPAOPASTFFSSFLELLKHLVEMDTDQLICLY 291 
292 EEPLSKLItCTLFPFEAEAYRNIEPVYLNMLLEKLCVMFEDGVLMRLKSOLLKAALCHLLOYFLKFVPAGYESALQ 366 
367 VRKV YVRN ICKALLDVLG 1 EVDAE YLLGPLYAAL KMESME 1 IEEIOCOTOQENLSSNSOGISPKRRRLSSSLNPS 441 
442 KMPKOTEEIKHVDMNQKSILUSALKQMESLOISLEYSGLKNPVIENLEGIAVVLOLTALCTVHCSHQNMNCRT 516 
517 FKDCQHKSKKKPSVV I TlvWSLDFVTKVLKSCRSLLES VQKLOLEATIDK VVK I YOAL I YMQVNSSFEDH I LEDLC 591 
592 GMLSLPWIYSHSDDGCLKLTTFAANLLTLSCRISDSYSPQAQSRCVFLLTLFPRRIFLEWRTAVYNWALQSSHEV 666 
667 IRASCVSGFFILL0Q0NSCNRVPKIL10KVKDDSDIVKKEFASILG0LVCTLHGMFYLTSSLTEPFSEHGHVDLF 741 
742 CRNLKATSQHECSSSQLKASVCKPFLFLLKKKIPSPVKLAFIONLHHLCKHLDFREDETDVKAVLGTLLNLMEDP 816 
817 OKDVRVAFSGN I KH ] L ESIDSEDGF I KE LFVLRMKEA YTHAQ I SRN.MEL KDTL I LTTGDIGRAAKGDLVPFALLH 891 
B92 LLHCLLSKSASVSGAAYTEIRALVAAKSVICLQSFFSOYKKPICQFLVESLHSSQWTALPNTPCONADVRKODVAH 966 
967 OREMALNTISE IANVFDFPDLNRFLTRTLOVLLPOLAAKASPAASAL IRTLGKQLNVNRRE I L I NNFKY IFSHLV 1041 
1042 CSCSKOELERALHYLKNETEIELGSLLRODF03LHNELLl.RIGEHYOOVFNGLSILASf\ASS[)DPYCX3PR01ISP 1116 
1117 ELHAOYLQPKLLGILAFFNMQLLSSSVGIEDKKMALKSLMSLKKLMGPI^VSSVRVKMHTTLRTGLRFKDDFPEL 1191 
1192 CCRAWDCFVRCLDHACLGSLLSHVI VALLPL IHI QPKETAAI FHYLI IENRDAVQOFLHE I YFL PDHPELKK I KA 1266 
1267 VLOE YRKETSESTQLOTTLOLSNKAI OHENVDVR I HALTSLKETLYKNQEKL 1 KYATDSETVEP I ISOLVTVLLK 1341 
1342 GC0DANS0ARLLCGECLGELGAIDPGRLDFSTTETO3KDFTFVTGVEDSSFAYGLLMELTRAYLAYA0NSRA0DS 1416 
1417 AAYAIQELLSIYDCREKnHGPGHOLWRRFPEHVREILEPHLNTRYKSSOKSTDWSGVKKP[YLSKLGSNFAEWS 1491 
1492 ASWAGYL ITKVRHDLASKI FTCCS I K.MKKDFKV1 I YLLPH I LVYVLLGCNOCDOQE VYAE1 MAVIKHDDOHTI NT 1566 
1567 0DIASOLCQLSTOTVFSKLOH.T0WARHICFOAuKAEKCPHSlCSNRKKVOSMVSTVDYE0YOSVTRFLDL IPODTL 1641 
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1642 AVASFRSKAYTRAVMHFESFITETOJMQEJ&GFLQICLYAAMHEPrc^ 1716 

1717 DATACYDRAIQLEPDQIIHYHWVICSMLGim 1791 

1792 AADGKSTTfcSVRLGQLLLSAWCRDITAFYD^ 1B66 

1867 LFQHSPGDSSQEDSLmARLEMTQfEYRAXEPIU\^^ 1941 

5 1942 NALUIAGESRLAELYVERAKWLUSKGDVHQ^ 1016 
2017 ESWAI KKKYKD\n"ACLPEWEDGHFYLAKY YDKLMPMVTDNKMEKQGDL I RY I VLHFGRSLQYGNQF I YQSNPRMl 2091 

2092 TLWlDYGTKAYEWEKAGRSORVQ^DLGUm 2166 

2167 VFU\YPQQAHMtfTAVSKSSYPMRVWRCKEILN^ 2241 

2242 HFIttlKiaVEEATFSEILIPLQSVmPTLPSILGTH^^ 2316 

10 2317 SDGKFYIKMCKPKOOLRKDCRlMEFNSLINKCLRKDAE^ 2391 

2392 KLYKEKGVYKTGKELRCmPKSAALSEKLKVFREFLUW 2466 
2467 VGYI LGLGDRHGEN I LFDSLTGECVHVDFNCLFNKGETFE VPE 1 VPFRLTHNMVNGHGPHGTEGLFRRACEVTWR 2541 

2542 LMRDQREPLHSVliaFLHDPLVEVfcKPVKGHS^ 2616 

2617 EGHVHYLIQEATDENLLCQMYLGWTPYM 2664 

15 Sequence ID No. 3: ractt.seq 

1 GGTACCAAGTAAAAACTGCnAGTAAGTATAAAACACAGAAGAATCCGCGATCT AGTGAACCAATGCCCTGCGTA 75 
76 TGACGCTCCACTGACGCTATAGTCMTGAGAACTAGGAT^^ 150 
151 TATTTATTTAAGAAATMnGMnAAAACTCATTTC^ 225 
226 ACTACCTTTCAACAATAAC 1 1 1 1 1 1 1 1 1 1 GTTTAnGACCATTATMTCACATCAAAAGTCAAAAAATTCAATCA 300 
20 301 nATCAGAMCATCCAGCaMTATTACTTAA^ 375 
376 ATTAGCATCGCTCGATACnAGTCCACXATGCATCTTCCTTTACCTGGTGAGTGGAAATC^ 450 
451 TGCCACTTTTCGCATAATTCTATTGAGATATTTCAT^^ 525 
526 CGCGCGTGTTGCGTmAAAAAGGCCTTTTmGAATTGMTCAATGGm 600 
601 AAAGGAAAGCTGGGTCACTCGATCmCACCCAG^ 675 
25 676 AACTATTAGCATTAGACAAAGAACATGAGTTAGGTAGAAGTM 750 
751 AAGTTTT AATTGAAGnGGTCnCTAGCTTTTAAACATGAT^ 825 
826 TAAAAGAAGCCCATCTCTCTCT ACAAGCGTTAATGCTAATCTT AAAMGGTCTCCGACAGTTTTGCGGGAGATTA 900 
901 MTCATCTGTTACTCTTTTGGAnGGATTTTACC^ 975 
976 TATTTGACTCATTAAAAGAGTTTCATAAGCTMTTTATCAGCTMTCAGTGAAAA 1050 
30 1051 ATGCTTCGTTTATGCGTTAnGGAAATATTATAHACA H25 
1126 CATTCCCnACAAGATGCCCTCACCCMnCTCA^ 1200 
1201 AGGACAMTTT6ATTTACnATCATTAATATAGAGGAGGCTTGTAC A 1 1 1 1 III 1 1 G AAAGTGCCCATTTTTTT6 1275 
1276 C ACAATGC TCATATTTAMGAAATCCAATTTTCCTACT 1350 
1351 GTTT7TI 1 AATTTTGnATTnAnAAAACGMTCAGCATCGGAGACT 1425 
35 1426 GTATAGTCCAMCTT^ATGCTGTTTTTCCTTGMTTTrATATATCATGGCCncCK 1500 
1501 ATATTTTAATCTCacaTCAAOT I575 
1576 CTCl M f (l 1 1 CTCTCTTTATnGTTTTACC^AAArrGTAGCTGGTCTATTCT ATCCTTTTGGGGT7TCCTTAC 1650 
1651 TTTCTGACnCAAGGTATTAGAGCMCTTGAACCAGATTCTGATCTCAAAAAGGCAATW 1725 
1726 GATACCAAAGTTCAGAMTAGATCAAACAACTCTCCGTGCTTnGG 1800 
40 1801 CGI (Gl 1 1 1 CTAACTCTGAATTAAACL 1 1 1 1 1LI 1 1 1 ACATTATCTTTCCTTGGACAATGACrTGTCAAATATTC 1875 
1876 nAAAGTGGATTTC^AATGGTCATAACATATGTACATTTGCAAMTGGTGTATAAA 1950 
1951 CGTCTMTTTAAAGCACTTTCGTGAMTGTTA3ATO 2025 
2026 TGAAGMCTTCTCTTTAGTTnGTGTACTCATGnGCAAAGGTGAATGAGAAAADUATAGT^ 2100 
2101 ATGAAGTACATGGTJGTGAAGmGTAACTCATTTTGTTTACTATT^ 2175 
45 2176 ATCAOGAATTGTTT^GTGCAnGCTAAAAAATCCCGACATAAT 2250 
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2326 TGAAAGCATTAGCCAGTACTTCAAGATffTn^ 

2476 £ 
2476 TGGAAAA TACAGCTA TTTTAGAAACGGTCA TTCTTTCCTGGA TCTCCTTATCTAGAGTGGTAGAAGAAS/^AAI ?Zn 

22 ?J^ GTACTAn ^™ TATCn ™™ T ™™^ SL 

2701 TTGCGA TTGTCCAAGGTA TGGGTAAAAMCCGAACA TAGCCAGTTTA TTTGCTCAGCTTA TGAA TA JTTCC^^ 

% 52^^ So 

2851 GTA TAGCTGAACTTTCACAAAGTGA TGTTGCTACTTTGTGCCTTACCAA TA TGCA TAAAA TCCTTGCTTCGCTA£ 

3076 TTCCTCA TGAAMAA TTGAAAATGCTTTAAGAAAGGTAGCAA TGA TTGTCTCTCAAGTGGTJAA 7G4GS44fi4rr 32 Z 
3151 TGAGCAA TAAGGAA TTA C TTTA TGA TTTTTTTAA TAA TCACA TTTTGGGTA TCTTAGC^^AATTnC^A^TCC 3??l 
3226 rTA^GACamASSAAAGACmM 

.3301 TA TGTGGAGGTGCAGTCAAACTTGGA TTACCACAGATACTTTCTAA TTTACAMGTG^TTTTCAA^TGA^^CT 2£ 

3676 '"TTAAGKmMCTAAWMTOGCCAGITnmA ?2 

m SSSS^^ 3975 
3976 ACGGAGAAGAMGTTTGAAGTTTATTCTAGA TTTTA TGCAATCGCAGTTAA TTCCAGCTTTCCTTGTTACTACTG ioso 

4126 TTMTMAAAAMGG^CTAMTGTGGTMCAIMCATT^^ Iff? 

4361 A^^TA^T^r^-r^rr^ T^GTTGTTTTCTCTTAAA TTGA TGGAGTACGCCCA TTCGCAAAACGt^AAA Zo 

4426 TTTTAAA TGTTA TTTTAACCGAGTCAGAACTGGAAGTTAA TAAAGTCA TTGAAGAA TTCCAGCTTGTTA TTAA TC Ysnn 
4501 AACCGGGACCTGA TGGA TTAAA TTCCGTGGGGCAACAAAGA TACACCTCATTTGTj^^^^rTTTT^MG^TTG 4576 

Sf 4650 
<6S1 AGAACCGTTA TA TGTCGGTGGAAGATGCTACCTCTCGAGAA TCA TCGA TCTCAAAAGTTGWTCATTTCTTffiC 
4726 "TTKCnCAAAJ^nAGGTAnGTCTCmAMTTG™ *™ 
4801 AACACA TACGTAA TGCTACAGCTCCA TA TGCAGCTTTAGAGTCCGATTA TAGAGTTTTGG^^AA^ATA^WCTG Zl 
4876 GAA TTGATGA TCCAGA T6AAA TCGAAGCAGTGTCTTTAAA TTTCCATGA TTACTCGTTTG^MCAACTC^TTT Z 0 
A^Irrrr^ ^^^^^^^^^^^^^GTTGTTACGAAA TTATTATTCAAAAGGKTO^&^AtfA 
AAAAGGCGAAAA TCGGTTTGCTTAACAGCA TGCTGCAA TCGGGGCA TTA TKAA TrrrTTCTTTTei*mTit* -!t 



5026 
5101 



AUAM^nSES^T ^ Cm " Gn ™^™™TrCAJ^GOATCaGAAMTA 5025 
AAAAGGCGAAAA TCGGTTTGCTTAACAGCA TGCTGCAA TCGGGGCA TTA TGAA TCTCTTGTTTTGAGTTTAGA TT d on 

5176 CTA TTGATTCGTTAAAAAAGT67CTT1CAAAAA6CAACT)GGAA TCTTTCGAAGCTAAA TTGGGTAGCATAT7TT IIZ 
5251 ACCAA TACCTACGGAAGGA TTCTTTTGCTGAA ^(^CGGAGCGGCTGCA^^TG^^GT^^TG^TG^T^CAG T 2S 
5326 CMTTGCAAACACAGGCGCCCA TTCAGCCTA TGA TTGTTA TGA TA TTHATCTAAGCTGC^^^MTT^AT^CT 5400 

5476 TAGCTCC6TACGGTAAA TTCAAGCACCAAA TCCTG7CCACTCACTTAGTTGGCTATGAAAAA TTTGAA^ACACGA 5550 
5551 ^CTGCTGAAATATATCTCGAGAmCAAGAATATCTCGAAAAM 

5ll ZSS^^™^ 1o 
5701 AACA TC6TAAAGCTA TTTCTGAATTGAA TTTTTCGCTTAA TAACAACA TGTTTGA TTTGGTTGA TGAGCA TGAAG 5775 
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5776 AAAGACCTAAAAA TCGTAAAGAAACTTTAGGAM TCCACTTAAAGGAAAAGTGUCTTGAAACTTACAAAA 7EGC 5850 
5651 TCGGAAAAGCTGGCCAACTGGGA TTGAAGGA TTTGGAGACGTA TTA TCATAAAGCGGTAGAGA TTTAQTCAGAAT 5925 
5926 GTGAGAA TACGCA TTA TTA TCTTGGCCA TCA TCGAGTTTTAA TGTA TGAAGAAGAACAAAAGCTCCCAGTTAA TG 6000 
6001 AACAGAGCGAACGA TTTTTAAGTGGTGAGTTAGTAACTCGCA TAATTAACGAA TTTGGTCGA TCTTTGTACTA TG 6075 
5 6076 GTACAAA TCA TA TA TA TGAAAGTA TGCCAAAA TTGCTCACAC TGTGGCTTGA TTTTGGGGCCGAAGAACTTCGCT 6150 
6151 TATCTAAAGATGACGGCGAAAAGTACTTTCGTGAACACAUATCTCTTCGAG 6225 
6226 ATTC(MTGTTTGKGCCTn'CTATGAAMTTCCTCAATAC 6300 
6301 GAGTA TGCCA TCCAAA TAA TAAAGTTTA TAAAA TTTTQGAACA TA TAA TTGCAAACGTTGTAGCA TCTTATCCTG 6375 
6376 GGGAGACGCTA TGGCAA TTAA TGGCAACAA TAAAA TCGACTTCTCAAMGCGCTCGCTTCGTGGAAAAAGCA 777 6450 

10 6451 TAAA TGTTTTACATTCTAGGAAGCTTTCTA TGTCTTCCAAAGTTGA TA TAAAAGCACTCAGTCAA TCTGCAA TK 6525 
6526 TCA TTACTGAAAAGTTAA TCAA TTTGTGCAATACAAGGATTAACAGTAAATCTGTAAAAA TGAGCTTAAAGGATC\ 6600 
6601 ATTTrCGGCTTTCTTTTGATGATCCGGTAGATTTACT 6675 
6676 CTAAAGATIOAACAGAGCTAGTCATTATCCATTTC^ 6750 
6751 JGGATATWGAACTCTCnCAAAAACCAAGWAAAGtTUrACG^ 6825 

15 6B26 JtcJGCAAACCCAAAGATGATCTTTGTAA 6900 
6901 GGAAAGATCAAGAAGCGAACAGAAGGAACTTGTGTATTAGAACnA 6975 
6976 GA7TTATCGMTGGGTAMTCATACTO 7050 
7051 CCATATCATATCAAGAAATCAAAGTTGATTTAGACTTTTC 7125 
7126 AGAAAATCnACCGAMmOT 7200 

20 7201 TTACT AGTAGACAAAACTAnGCCGAACTTTAGCAGTMTGTCAATAGTTGGCT 7275 
7276 GCCATGGCGAAAACATATTGTTTGATGAATnACAGGTGAAGCTATCCATCT 7350 
7351 AAGGTCnACTTTTGAAAAACCTGAAAAGGTGCCGnCAGATTAACTCA 7425 
7426 CAGGTTATGAAGGGGGTTTCAGGAAAGCTAGCGAAATAACGATGCGGOT 7500 
7501 TGAGCGTACTAGAGTCTTTCCTACATGATCCTTTAGTO 7575 

25 7576 ATAATGMGCAAATGAAGT7TTGGATATAATTCGCAAAAAATTC 7650 
7651 TATCTATTGAAGGGCAAATTCAAGAAT^^ 7725 
7726 GnGGGCTGCnAmOTAGCATmACTMC 7800 
7801 TGAATCGAAGATATTTrATTTATTMTCCGATGAAGAATTCTCGCTGAGn 7875 
7876 CCATTTCTAAATCGTCGATTCGCTTAAA 7950 

30 7951 AnGAAAAAAAGCTATATCTGTTTCTGAGTCATCATCC^ 8022 

In italics, sequenced by Seaton et aL 

In Bold are those bases deleted in Seaton et al. (2499. 22501. 2507. 2509) 
Underlined are the two bases either side of a single C insert (5918/5919) in Seaton et ah 
(i.e. the incorrect base not shown, but the one residue either side is) 

35 Sequence ID No. 4: rad3 protein 

1 HS0HAKRJCAGSL0LSPRGLDDROAFGQLLKEVLAU)KEHELGI^ 75 

76 ISPKMLKEAHLSLQAI^^ 150 

151 FLVTOLYASFMRYViTCYYITW 225 

226 SAHFFAQCSYLKKSNFPSPPLFT^ 300 

40 301 CEKSKHILJtSSINLTLGSLKKTYTVAOT 375 
376 I LFKCRYQSSE I DQTTLRAFGE I CTGKLENTLFSNSELNL Ft LHYLSLDNDLSN I IK VDFQNGHN I CTFAKWCI N 450 

451 NNLDEPSNLKHFREMLDYYSSHNVTISE 525 

526 PFKIPYHELFCALLKNPDIISSSVKQSLLLK^ 600 
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601 inKGPmDtVEFHKESKAU^ ^ 
676 filCLSALQQIASTIWISViaiSPYlimsVAIVQB^ m 
751 KALI VRIAELSQSDVA TLCL TNWILASLL TTDWNLEESVMLLLSLA TSDFEKVDLTSLLRSDPISI TVELLO 825 
i W^V^HENALRKVAMIVSQVYNDEDLSNKELL YDFFNHHIL6ILAEFSN1LNDLK6KT5INEKIKTI VGI 900 
976 LVlQlFDFlSSDTHKClOGtKUATPTXthCircctvArrTr™^^.—.— 



751 
826 
901 



... L VIQ1FDF1SSDTHKCLQGIKHA IPTSLDSACFSLKAKEIFCSLQNEDFYSELQSIIKCL TNENEPVCYLGL^KL 1050 
1051 affDmUTO^^ j5 
1126 LDNFENGEESLXFILDFNQSQL 1PAFL VTTD TKAQGFLA YALQEFLKLGGFKSA V1HKKKGL TVYTEfMISLPDL 1200 
1201 SKRVUPFL TSKYHL TP1PK10IRYP1 y^^IHJWOlfSLWCYAHSQtlAEKlFGICSKt^^OEnl^^ £J 
1276 U.PFLVLmLKSELEVmiEEFQLv m PGPOGLmVGQQRYlSFW^ j£ 
1351 A IARKENRYHSVEDA TSRESSISXVESFLSRFPSKTL CI VSLNCGFHARALFYWEQHIRNA TAP YMLESDYRft. 14X 
1426 QMAGlim>EIFAVSU<FWYSFl^LmSGmW 
1501 USLDSFIimWEYSKMLKGIEASmLSIDSLKKCLSKSN^ 

J« !!!^^ 1650 

1725 



VOAATAim^YDCWILSKLHAimFSRlAFmiYSDNLDIVLRRRLSOVAPm 
KFENTKKTAEI YLEIARISP.KUGQFQRAFNA I LK^WKPLATlEMWiM&GQHRKAISELNFSLNNNMFOL 1 725 

i™ JSSS^^ boo 

1801 KLPVNEGSERFLSGELV7TU INEFGRSLYYGTNHI YESMPKLLTLWLDFGAEELRLSKDCX3EKYFREHI ISSRKK 875 



w. • • u i in 1 1 1 unr*u. i LNLurtaftttUOiKIXJSEKTFREHI ISSRKK 

SLELMNSWCRLSMKIPQYFFLVALSQMISRVCHPNNKVYKILEHnAW^^ 
RGKS 1 LNVLHSRKLSMSSKVDI KALSQSAI L I TEKL INLCOTlttNSICSVKMSLKOHFRlSFDDPyDL VI PAKSFL 



■ -w. ui-r^r tut v u i nniujKr kf> v r VKfc I IXiNL YPRLCKPKDDLRKDARLMEFNNL 210D 

2101 lOCILRKDOEAMRRNLCIimrWIPLNEECGFIEMVimPTC ™ 
2176 <BIFEKKILPKFPPVREWFVESFPEPNNWVTSRW 

2251 NCLFDKGLTFEKPEKVPFRLTIWVaAMGPTGYEGGFRKASEITMRLLRSNQ^ 2325 
2326 SSKYPNNEANEVLDIIRIOCFQGFMPGETIPL^IEGQIQELII^SAVNPKNLVEMYIGHAAYF 2Mg 



